Disambiguating Personal Names on the Web Using Automatically Extracted Key Phrases

نویسندگان

  • Danushka Bollegala
  • Yutaka Matsuo
  • Mitsuru Ishizuka
چکیده

When you search for information regarding a particular person on the web, a search engine returns many pages. Some of these pages may be for people with the same name. How can we disambiguate these different people with the same name? This paper presents an unsupervised algorithm which produces unique phrases to disambiguate different people with the same name (i.e. namesakes). Our algorithm takes in a personal name and outputs multiple sets of phrases which uniquely identify the different namesakes on the web. These phrases could then be added to the query to narrow down the search to a specific namesake. We evaluated the algorithm on a collection of documents retreived from the Web. Experimental results show a significant improvement over the existing methods proposed for this task.

برای دانلود متن کامل این مقاله و بیش از 32 میلیون مقاله دیگر ابتدا ثبت نام کنید

ثبت نام

اگر عضو سایت هستید لطفا وارد حساب کاربری خود شوید

منابع مشابه

Identifying People on the Web through Automatically Extracted Key Phrases

Assume that we are looking for information about a particular person. A search engine returns many pages for that person’s name. Some of these pages may be on other people with the same name. How can we identify the results for the person that we are interested in from the others? A simple but an effective solution is to add a phrase in the query that uniquely identifies the person we are inter...

متن کامل

Exploring Key Phrases for Browsing an Online News Feed in a Mobile Context

This paper describes ongoing work on how to automatically identify and use key phrases extracted from items of a news feed available on the Internet. These phrases are used for two different tasks: users of mobile devices (e.g., cellular phones and personal digital assistants) will be able to subscribe to news in different categories, where the categorisation of the news is based on the extract...

متن کامل

Semantic Search: from Names and Phrases to Entities and Relations

Web search is traditionally limited to keyword queries. In the era of Big Data and the Web of Linked Data, one would expect that schema-free search over both text and structured key-value pairs becomes more semantic, Systems should, for example, identify entities in queries and return crisp answers referring to facts, other entities and relationships. Some of these desired advances are happenin...

متن کامل

Extracting Key Phrases to Disambiguate Personal Names on the Web

When you search for information regarding a particular person on the web, a search engine returns many pages. Some of these pages may be for people with the same name. How can we disambiguate these different people with the same name? This paper presents an unsupervised algorithm which produces key phrases for the different people with the same name. These key phrases could be used to further n...

متن کامل

Automatically Extracting Personal Name Aliases from the Web

An entity can be referred by multiple name aliases on the web. Extracting aliases of an entity is important for various tasks such as identification of relations among entities, automatic metadata extraction and entity disambiguation. To extract relations among entities properly, one must first identify those entities. Aliases of an entity are useful as metadata for that entity and can be used ...

متن کامل

ذخیره در منابع من


  با ذخیره ی این منبع در منابع من، دسترسی به آن را برای استفاده های بعدی آسان تر کنید

برای دانلود متن کامل این مقاله و بیش از 32 میلیون مقاله دیگر ابتدا ثبت نام کنید

ثبت نام

اگر عضو سایت هستید لطفا وارد حساب کاربری خود شوید

عنوان ژورنال:

دوره   شماره 

صفحات  -

تاریخ انتشار 2006